spider.web.tex 



The Linking Probability of Deep Spider- Web Networks 



Nicholas Pippenger 
njp@princeton . edu 

Department of Computer Science 
Princeton University 

35 Olden Street 
Princeton, NJ 08540 



Abstract: We consider crossbar switching networks with base b (that is, constructed from 
6x6 crossbar switches), scale k (that is, with b k inputs, b k outputs and b k links between 
each consecutive pair of stages) and depth / (that is, with / stages). We assume that the 
crossbars are interconnected according to the spider-web pattern, whereby two diverging 
paths reconverge only after at least k stages. We assume that each vertex is independently 
idle with probability q, the vacancy probability. We assume that b > 2 and the vacancy 
probability q are fixed, and that k and / = ck tend to infinity with ratio a fixed constant 
c > 1. We consider the linking probability Q (the probability that there exists at least 
one idle path between a given idle input and a given idle output). In a previous paper 
it was shown that if c < 2, then the linking probability Q tends to if < q < q c 
(where q c = l/6( c_1 )/ c is the critical vacancy probability), and tends to (1 — £) 2 (where 
£ is the unique solution of the equation (l — q(l — x)) b = x in the range < x < 1) if 
q c < q < 1. In this paper we extend this result to all rational c > 1. This is done by using 
generating functions and complex-variable techniques to estimate the second moments of 
various random variables involved in the analysis of the networks. 



1. Introduction 

We deal in this paper with linking in crossbar switching networks, a phenomenon not 
dissimilar to that of percolation in lattices (as introduced by Broadbent and Hammers- 
ley [B] and surveyed by Grimmett [G]). An important difference, however, is that while 
percolation can be studied in finite subgraphs of a single infinite graph modelling the lat- 
tice, there is no single graph that naturally hosts the graph modelling crossbars switching 
networks we are interested in. Our first order of business will be to describe these graphs. 

A crossbar graph will be characterized firstly by three parameters: its base b > 2, 
its scale k > and its depth I > 0. Its vertices are partitioned into / + 1 ranks, each 
containing b k vertices, which are labelled with the strings of length k over the alphabet 
{0, . . . , b — 1}. The vertices in rank are called inputs, those in rank / are called outputs, 
and those in all other ranks are called links. The edges of the graph are partitioned into / 
stages, each containing b k+1 edges. For 1 < m < I, the edges of stage m are directed out of 
vertices in rank m — 1 and into vertices in rank m. In a spider-web crossbar graph, which 
is our main concern in this paper, there is an edge of stage m from vertex v of rank m — 1 
to vertex w of rank m if and only if v and w are labelled by strings that differ at most in 
position j, where j = m (mod k). The edges of each stage are thus partitioned into b k ~ 1 
b x b complete bipartitite graphs (called crossbars) . The spider-web crossbar graph with 
base b, scale k and depth / will be denoted Gb,h,i- We shall see in Section 2 that if / > k, 
there are b l ~ k paths from a given input to a given output; if I < k, there is at most one 
path from a given input to a given output. Our main interest is in spider-web crossbar 
graphs with / > k, since in these graphs any input can be connected by a path to any 
output; in our analysis, however, graphs with / < k will occur as subgraphs, so it will be 
necessary to allow this case in some of our intermediate results . 

We shall assume that each vertex in the graph Gb,k,i is independently assigned the 
status idle, with probability q (called the vacancy probability), or busy, with the comple- 
mentary probability p = 1 — q (called the occupancy probability). This random assignment 
of a status to each vertex in a graph will be called the state of the graph. Given an input 
v and output w, let Q VjW (called the linking probability) denote the probability that there 
exists a path consisting entirely of idle links from v to w. (In general, the linking proba- 
bility is define as the conditional probability that there exists an idle path, given that v 
and w are themselves idle, but for the probabilisitic model that we are using this condition 
is independent.) We shall see in Section 2 that if I > k, the probability Q VjW does not 
depend on the choice of the input-output pair (v, w), so we shall let Q denote the common 
value of these probabilities. The complementary probability P = 1 — Q (called the blocking 
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probability), is the probability that all paths between a given input-output pair (y, w) are 
broken by a set of busy links. 

In practice, the parameter p represents the amount of traffic being carried by a crossbar 
network (which one would like to maximize), and the parameter P represents the fraction 
of arriving traffic lost due to congestion within the network (which one would like to 
minimize). In analysis, however, it is almost always more convenient to work with the 
complementary parameters q and Q, so we shall work exclusively with these parameters 
in what follows. 

In practice, a graph Gb : k,i would be fixed, and the linking probability Q would be 
studied as a function of the vacancy probability q. It is found that Q undergoes a rapid 
transition from a value near zero to a significantly positive value as q passes through a 
neighborhood of l/fr(' _fe )/(' _1 ). This is easily understood in the following way. 

Let the random variable X VjW denote the number of idle paths from v to w. We shall 
see in Section 2 that if I > k, the distribution of X VjW does not depend on the choice of the 
input-output pair (y,w), so we shall let X denote a random variable with this common 
distribution. Each of the b l ~ k paths from v to w contains / — 1 links, which are all idle 
with probability q l ~ x . Thus we have 

Ex[X]=b l - k q 1 - 1 . (1.1) 

Thus as q passes through l/ftC - *)/^ -1 ), the expected number of idle paths from v to w 
(called the specific transparency) goes from an exponentially decreasing to an exponentially 
increasing function of k and /. This suggests that if k and I tend to infinity in such a way 
that their ratio c = l/k > 1 remains fixed, while b and q are also held fixed, then Q will 
tend to a limit, and that this limit will have a discontinuity as q passes through the critical 
value 

q c = l/6(c-D/c. 

(We note that 1 < c < oo implies 1/b < q c < 1.) Our goal in this paper is to confirm this 
conjecture, and to determine the limiting value of Q. 

Our first step toward this goal, taken in Section 2, will be to derive the following 
estimate for the second moment ExLY 2 ] of X. 

Theorem 1.1: Let b > 2 and 1/b < q < 1 be fixed. Then 

Ex[Y 2 ] = ExLY] • ( (^-) b l ~ k q l+1 + 1 + 0(1 b l ~ 2k q l ) + 0(1 q k )\ 
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as k, I — > oo with / > k and (log(/ + 1)) /(k + 1) — > 0. (The constants in the O-terms may 
depend on 6 and g, but are independent of k and /.) 

We observe that this estimate is enough to establish that the limiting value (if it 
exists) of Q for k — > oo and / = ck, cannot be a continuous function of Q as q passes 
through q c . Indeed, from Markov's inequality and (1.1) we have 

Q = Pr[X > 1] < Ex[X] = b l ~ k q l ~ x -> (1.2) 

for q < q c . On the other hand, (1.1) and Theorem 1.1, together with the inequality 

imply 

(6g - l) 2 



(b-l) 2 q 2 + (bq-l) 2 q 



> (1.4) 



for q = q c . (To verify (1.3), we consider the distribution of X conditioned on the event 
X > 1. Since x 2 is a convex function of x, we have 

Ex[X 2 \X>1}> Ex[X | X > l] 2 . 

Multiplying by Pr[X > l] 2 yields 

Ex[X 2 ] Pr[X > 1] = Ex[X 2 | X > 1] Pr[X > l] 2 > Ex[X | X > l] 2 Pr[X > l] 2 = Ex[X] 2 , 

which is equivalent to (1.3).) The inequalities in (1.2) and (1.4) show that the inferior 
limit of Q for q = q c is strictly greater than the limiting value for q < q c , as claimed. 

The argument of the preceding paragraph also sheds some light on the condition 
(log(/ + l))/(fc + 1) ^0 in Theorem 1.1. (This condition involves k + 1 and / + 1 rather 
than k and / simply to avoid dividing by or taking the logarithm of 0.) This condition 
is not the weakest one sufficient to give an estimate of the form Ex[X 2 ] = 0(Ex[X] 2 ), 
but it is clear that some upper bound on the growth of / must be imposed, for with 
probability (1 — q) all the links in a given rank are busy, disconnecting all input-output 
pairs. Thus if 1/(1 — q) — > oo we have Q — > 0, contradicting the implication of (1.3) when 
Ex[X 2 ] =0(Ex[X] 2 ). 
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In Section 3, we shall combine Theorem 1.1 with branching- process arguments from 
a previous paper, Pippenger [P3], to establish the existence and determine the limiting 
value of Q for q > q c . 

Theorem 1.2: Let 6 > 2 and < q < 1 be fixed, and let c > 1 be rational. Then as k — > oo 
with / = ck, we have 

0, if < q < q c , 



Q 



(1-0 2 , ifg c <?<l, 



where £ is the unique solution of the equation x = (l — q(l — x)) b in the range < x < 1. 

Some comment is in order concerning the behavior of (1 — £) 2 as a function of q. 
The function f(x) = (l — q(l — £)) is a strictly convex function of x for < q < 1, 
since /"(x) = 6(6 — l)(/ 2 (l — (?(1 — x)) 6 2 > in this range. Thus the graph of f(x) can 
intersect the diagonal at most twice in this range. There is one intersection at x = 1, and 
the conditions /(0) = (1 — q) b > and f'(l) = bq > 1 imply that there is at least one 
intersection in the range < x < 1 when 1/6 < q < 1. Thus there is indeed a unique 
solution of the equation x = (l — g(l — in the range < x < 1 when 1/6 < g < 1, 
and this latter condition is implied by q c < q < 1. The degree of this equation can be 
reduced by one (because of the solution x = 1), and it is easy to see that the resulting 
equation is irreducible over the field of rational functions of q; thus £ is an algebraic 
function of q of degree 6 — 1. Since (1 — £) 2 is a polynomial in £, it is also an algebraic 
function of q of degree 6—1. Straightforward analysis shows that Q — > 1 as q — > 1 with 
1 — (5 = 1 — (1— 2 ~ 1 — 2(1 — (/) & , which may be interpreted as saying that the main 
obstacle to linking when q — > 1 is complete occupation of the 6 links adjacent to the input 
in the rank 1, or of the 6 links adjacent to the output in rank I — 1. As q — > 1/6 from 
above, we have (1 - £) 2 ~ (6? - I) 2 /© 2 - 

Theorem 1.2 was proved, under the additional restriction c < 2, by Pippenger [P3], so 
the additional contribution of the current paper consists of lifting this restriction. Never- 
theless, the techniques used in the current paper go considerably beyond those employed 
in the previous paper: the proof of Theorem 1.1 starts with a detailed combinatorial ex- 
amination of the intersections between paths, then uses complex-variable techniques to 
determine the asymptotics of the quantities involved. 

Spider-web networks were introduced by Ikeno [I] (though the name has sometimes 
been used to refer to a broader class of networks) . They have several optimality properties 
among networks constructed from the same type and number of crossbars. Takagi [T] 
show that they have the largest linking probability in a large class of crossbar networks 
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called "rhyming" networks. Chung and Hwang [C] showed, surprisingly, that they are not 
optimal in the larger class of "balanced" networks. But Pippenger [P3] showed that they 
are asymptotically optimal in this class for 1 < c < 2, and the current paper extends this 
result to all c > 1. 

The probability distribution on states that we use was introduced by Lee [LI] and 
Le Gall [L2, L3]. It is by far the easiest to use for analytical purposes, but it suffers the 
defect that the set of busy vertices does not form a set of coherent paths from inputs 
to outputs. Models addressing this defect have been introduced by Koverninskh [K] and 
Pippenger [PI], and the results in Pippenger [P3] have been extended to these models in 
Pippenger [P2]. It seems likely that the results of the the present paper can be similarly 
extended. 

The current paper is self-contained, except for some estimates concerning branching 
processes taken from Pippenger [P3] . We have followed the notation of that paper, except 
that the base, which was denote d in that paper, is now denoted b (to free the symbol al 
for its traditional use in the calculus). 

2. The Second Moment 

Our goal in this section is to prove Theorem 1.1. We begin with a combinatorial result 
concerning spider-web graphs. 

Lemma 2.1: The automorphism group of Gb,k,i acts transitively on the paths from inputs 
to outputs. 

Proof: Since an automorphism must permute the vertices of each rank among themselves, 
an automorphism ■# may be regarded as a sequence = (#ch • • • > $l) of permutations, one 
for each rank. We shall focus on automorphisms in which each $ m (for < m < I) is 
characterized by a string $ m) i • ■ •'&rn ! k of k digits from the alphabet {0, — 1}, and 
acts on the vertices of rank m by carrying the vertex labelled ai • • • to the vertex labelled 
a[ • • • a' k , where a'j = aj + $ m ,j (mod d) for 1 < j < k. If, for 1 < m < I, the string $ m -i 
differs from the string $ m at most in position j, where j = m (mod fc), then the sequence 
$ = (#cb • • • > $l) wm constitute an automorphism. 

To show that the automorphisms act transitively on the paths, it will suffice to show, 
for some fixed path u*, that for every path u, there is an automorphism that carries u* to 
u (since then the inverse of such an automorphism can be used to carry any other path 
u' to u*). A path u may be regarded as a sequence u = (iio, . . . ,ui) of vertex labels in 
which, for 1 < m < I, the string u m -\ differs from the string u m at most in position j, 
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where j = m (mod k). We shall choose for it* the path u* = (0 fe , . . . , k ). Then clearly 
the automorphism $ = (t? , • • • > $/) defined by -d m = u m for < m < I carries u* to u. □ 

Corollary 2.2: If I > k, the graph Gb,k,i contains b l ~ k paths from any given input to any 
given output; if I < k, there is at most one path from any given input to any given output. 

Proof: If I > k, every input-output pair is joined by at least one path, since every position 
in the strings labelling vertices has an opportunity to change at least once. Thus by Lemma 
2.1, every input is joined by the same number of paths. Since each of the b k inputs is the 
origin of b l paths to outputs, there are a total of b l+k paths joining inputs to outputs, and 
thus b l ~ k paths joining each of the b 2k input-output pairs. If I < k, there is a path from 
input v to output w only if the labels of v and w agree in the last k — / positions. Thus 
Gb,k,i breaks into b k ~ l disjoint components, each containing b l vertices in each rank; there 
is a unique path joining input v to out[ut w if they belong to the same component, and 
no path joining them if they belong to different components. □ 

Corollary 2.3: If I > k, the automorphism group of Gb,k,i acts transitively on the input- 
output pairs. 

Proof: If k > k, each input-output pair is joined by a path, so the corollary follows from 
Lemma 2.1. □ 

This corollary, together with the fact that the probability distribution on states of 
the graph is invariant under automorphisms of the graph, justifies our earlier assertion 
that the linking probability Q VyW and the distribution of the random variable X VjW are 
independent of the choice of the input-output pair (v, w) when / > k. Henceforth we shall 
focus attention on the input-output pair (v*,w*) = (0 fc ,0 fc ). If I > k, this entails no loss 
of generality. When I < k, we shall only deal with cases in which the input and output of 
interest are joined by a path, and in these cases there is again no loss of generality. 

Fix b > 2 and k > 1. For / > 0, let (pi(y) denote the generating function for the 
number of paths from the input v* = fc to the output w* = k classified according to the 
number of links that have labels different from fc ; that is, the coefficient of y m in (pi(y) 
is the number of paths from v* to w* that have I — 1 — m links in common with the path 
u* = (0 fc , . . . , k ). Clearly (pi(y) = 1 for < I < k, and (fi(y) is a polynomial in y of degree 
I - 1 if I > k+ 1. 

We are interested in the polynomials tpi(y) for various values of / > 0, with b and k 
fixed. To determine them, it will be convenient to work with a graph Gb t k that contains 
as subgraphs all the graphs Gb,k,i for various values of /. For any m > I > 0, Gb,k,i m ay 
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be regarded as the subgraph comprising the vertices in ranks through / and the edges in 
stages 1 through / of Gb,k,m- Thus we may define the infinite graph 



Gb,k = {J Gb,k,i 

l>k 



as the union (inductive limit) of all these graphs. The graph Gb,k has inputs in rank 0, 
but all other vertices will be referred to links. 

For / > 0, the polynomial <pi(y) is the generating function for the number of paths 
from the input v* = k to the link labelled k in rank / classified according to the number 
of links that have labels different from k . 



be the generating function for the polynomials <pi(y). The key to our estimate for the 
second moment of X is the following proposition. 

Proposition 2.4-' We have 



Proof: In this proof, we shall employ a concise alternative representation of a path u = 
(uo, . . . ,ui) of length / > as a string t = t\ • • of length k + I over the alphabet 
B = {0, . . . , b — 1}. The first k digits t\ • • ■ % of t will be the k digits of the label uq. For 
1 < m < I, tk+m will be the digit in position j of u m , where j = m (mod k) (the digit 
of u m that might be different from that of tt m _i). Then for < m < /, u m is the string 
£ m +i • • -tm+k- In particular, the last k digits of t are the k digits of the label ui of the link 
in rank /, and the paths from the input v* = k to the link labelled k in rank / are in 
one-to-one correspondence with the strings of length k + 1 over the alphabet B whose first 
k digits and last k digits are 0s. 

Given a path t = tk+i ■ • -ti-kO , let us overline each digit tk+ m (1 < m < for 
which u m -i 7^ fc . The result is a string over the alphabet BUB, where B = {0, . . . , b — 1} 
is the set of over lined digits. For / > 0, let the language Ki C (B U B) k+l comprise the 
strings obtained in this way for all paths from the input v* = k to the link labelled fc in 
rank /, and define K C (BUB)* by 



Let 




i>(y,z) 



l-byz + (b-l)(yz) k+1 



(1 -z)(l- byz) - (b - l)z(l - y)(yz) k ' 



K=[jK l . 
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Then ip(y, z) is the power series in y and z in which the coefficient of y J z l is the number 
of strings of length k + I in K in which j digits are over lined. Let 

L = 0~ k K = {t E (B U B)* : k t E K} 

be the language obtained from K by deleting the k initial Os from each string. Since none 
of this initial Os are overlined, ip(y, z) is the power series in y and z in which the coefficient 
of yi z l is the number of strings of length / in L in which j digits are overlined. 

Our next step is to write a regular expression for the language L. Define the alphabets 
B' = {1, . . . , b — 1} and B = {1, . . . , b — 1}. Then L is described by the regular expression 

((A+ (^'(a + O + .-. + o"- 1 ))*^" 1 ^)*, (2.1) 

where A denotes the empty string. To see this, we observe that a string in L can be 

uniquely parsed into into zero or more stretches, each of which ends with an unoverlined 0. 

A stretch consists of an unoverlined optionally preceded by an excursion. An excursion 

consists a final segment preceded by zero or more preliminary segments. A final segment 

consists of a digit from B ' followed by exactly k — 1 overlined Os. A preliminary segment 

consists of a digit from B ' followed by at most k — 1 overlined Os. Clearly a final segment 

z fc— i 

is described by the regular expression B , and a preliminary segment is described by 

the regular expression £? ' (A + H \-0 k 1 ) Thus an excursion is described by the regular 

expression 

(iT(A + 0+-.. + O fe - 1 ))Vo fc -\ 
and a stretch is described by the regular expression 

(A+(5 / (A + + ... + fc - 1 ))*5 / fc - 1 )0. 

Thus the strings in L are described by the regular expression (2.1). 

We now observe that the regular expression (2.1) is unambiguous in the following 
sense: a string described by a subexpression R + S is described by R or by S (but not 
both), a string t described by a subexpression RS has unique parsing t = rs such that r 
is described by R and s is described by 5, and a string t described by a subexpression S* 
has a unique parsing s — s± ■ ■ ■ s n with n ^ such that si, . . . , s n are each described by S. 

For an unambiguous regular expression, if ipn(y,z) and ifis(y,z) are the generating 
functions counting the strings described by subexpressions R and S, respectively, then 
^r{v,z) + ip s (y,z), ip R (y,z)ip s (y,z) and 1/(1 - ip s (y,z)) are the generating functions 
counting the strings described by the subexpressions R + S, RS and £*, respectively. 
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Thus the final segments are counted by the generating function (b — l)(yz) k and the 
preliminary segments are counted by the generating function 

(6 - 1M1 + „* + ...+ to;)*-) = (fr-D^-^r 1 ) . 

1 — yz 

The excursions are counted by 

(b-l)(yz) k _ (b-l)((yz) k -(yz) k+1 ) 



1 _ (b-l)(yz-(yz) k+1 ) l-yz-(b- l)(yz - (yz)^ 1 ) ' 
1 - yz 

and the stretches are counted by 

. (b-l)((yz) k -(yz) k+1 ) \ z - yz i-(b-l)z{yz-{yz) k ) 

1 + ~, 71 TT7 7 \T~TTT Z ~ 



l-yz-(b- l)(yz - (yz) k+1 ) J 1 - yz - (b - l)(yz - (yz) k+1 ) 
Thus the strings in L are counted by 

1 1 -byz + (b- l)(yz) k+1 



_ z - yz 2 - (b - l)z(yz - (yz) k ) (1 - z)(l - byz) - (b - l)z(l - y)(yz) k ' 
l-yz-(b- l)(yz - (yz) k+1 ) 

which completes the proof of the proposition. □ 

Proposition 2.5: Let b > 2 and < q < 1 be fixed. Then as k — > oo and / > behaves in 
such a way that in such a way that (log(/ + 1)) /{k + 1) — > 0, we have 

¥>l(?) = ^~V +1 + 1 + 0(^" 2 V) + 0(lq k ) + 0(lq l ). 

(The constants in the O-terms may depend on b and q, but are independent of k and /.) 

Proof: Write = l-bqz+(b-l)(qz) k+1 and = 

so that ^(g, z) = A(z)/B(z). Then from Cauchy's formula we have 



i/j(q, z) dz 



r z 



r aw dz^ 

Lb(z) z^ [2 - 2) 



2ni Jy B(z) z 

where T is a contour taken counterclockwise around a circle \z\ = e centered at and 
having radius e suficiently small to exclude all other singularities of the integrand. 
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To make further progress, we must estimate the locations of these other singularities, 
which are poles at the values of z for which the denominator B(z) vanishes. One such 
singularity is at z = 1/q. Let 



q V I. 

and let I\ be a contour taken counterclockwise around the circle \z\ = Ci centered at and 
having radius £i- As z traverses this contour, the magnitude of the first term (1 — z) (1—bqz) 
of B{z) satisfies the lower bound 

|(1 - z)(l - bqz)\ = \1- z\ - \l-bqz\ 



q J ql 

since the minimum occurs when z is real and positive. The magnitude of the second term 
(6 — 1)2(1 — q)(qz) k , on the other hand, satisfies the upper bound 



|(6 - 1)2(1 - q){qz) k \ = (b - 1) ( - - lVl - 



fc+i 



< (6-1) (J" 1 ) e ~ k/l 



q J \ el 

Here we have used the inequality 1 — x < e _x , which holds for all x because the graph of 
the convex function e~ x lies above that of 1 — x, its tangent at x = 0, and the inequality 
e~ x < 1 — (e — l)x/e, which holds for < x < 1 because the graph of the convex function 
e~ x lies below that of 1 — (e — l)x/e, its chord across the interval < x < 1. Thus for all 
sufficiently large k (specifically, for k > (bq — l)e/ (6 — 1)(1 — q)(e — 1)), we have the bound 



\b(z)\ = n 



for z on the contour T^. Since we also have A(z) = 0(1) for z on we have the estimate 

1 ^ A{Z) dZ =0(lq l ). (2.3) 



2ivi J Tl B(z) z l+1 

Furthermore, as z traverses the contour Ti, the value of the first term (1 — z)(l — bqz) 
in B(z) encircles the origin twice, since it is a quadratic polynomial. Since the second 
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term (b — l)z(l — q)(qz) k has strictly smaller magnitude, the value of B(z) also encircles 
the origin twice. It follows that the denominator of B(z) has exactly two zeroes inside the 
contour IV These are perturbations of the zeros of the first term: the zero of the first 
term at z = 1 is perturbed to one at 

z = ( 2 = l + 0(q h ), (2.4) 

and the zero of the first term at z = 1/bq is perturbed to one at 

The condition (log(/ + 1)) /(k + 1) -> ensures that the O-terms in (2.4) and (2.5) have 
smaller orders of magnitude than the terms preceding them. We observe that < ^3 < 
C2 < Ci> so that 0, ^3 and C2 he inside Ti, and lie in that order along the real axis. Let 
1?2 be a contour taken counterclockwise around a circle \z — (^| = £ centered at (2 and 
having radius e suficiently small to exclude all other singularities of the integrand, and let 
1?3 be a contour taken counterclockwise around a circle \z — = e centered at (3 and 
having radius e suficiently small to exclude all other singularities of the integrand. Since 
the integral of an analytic function around a contour depends only on the homology class 
of the contour in the domain of analyticity of the function, and since Tq is homologous to 
Ii — T2 — (indeed, Tq is homotopic to a contour that joins a forward traversal of Ti 
with reverse traversals of 1?2 and 1?3 by cancelling traversals of segments [£3 +e, (2 — s] and 
[C2 + £ ? Ci] °f the real axis), from (2.2) we have 



If A(z) dz 
^ {q) = 2^1 f Ti B{z) ^ 

1 I A(z) dz 



2ivi J r2 B(z) z l+1 
1 r A(z) dz 



2ttz J r3 B(z) z l+1 ' 



(2.6) 



The first integral in (2.6) has already been estimated in (2.3). The remaining integrals 
each encircle just one singularity of the integrand, and thus they can be evaluated by 
Cauchy's formula. If £ is a simple pole of the integrand, an V is a contour taken clockwise 
around just this singularity of the integrand, then we have 

1 / A(z) dz n A(z) 1 
' = Res 



2ni J T B(z) z l+1 z=C B{z) z l+1 

MO J_ 
B'(0 C l+1 ' 
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For the integral around T 2 we have A(( 2 ) = ~(bq - 1) + 0(q k ) and B'(( 2 ) = (bq - 1) + 
0(kq k ), so that 

1 r A(y\ r\y 

l + 0(lq k ). (2.7) 



2ni J r2 

For the integral around T 3 we have A(( 3 ) = (b- l) 2 (bq - 1) b k+1 + 0(k/b 2k ) and B'(Cs) = 
-(6g- 1) + 0(/c/6 fc ), so that 



1 if dz 



2 



6 |-fc 5 !+l + 0(/ 6 I-2fcgl)_ (2.8) 



2niJ r3 B(z) z l+1 \bq-l. 

Substituting the estimates (2.3), (2.7) and (2.8) into (2.6) completes the proof of the 
proposition. □ 

We observe that by extending the asymptotic expansions in (2.4) and (2.6), it would 
be possible to extend the expansions in (2.7) and (2.8), and thus reduce their contibutions 
to the error terms in Proposition 2.4. The error term in (2.3), however, cannot be improved 
without taking account of the zeroes of B(z) outside the circle \z\ = 1/q, which will in 
general contribute oscillatory terms to the expansion of (pi(q). 

Proof of Theorem 1.1: By Corollary 2.3, we may take X to be the number of idle paths 
from v* = k to w* = k . We then have 

= Y Pr[« idle, idle] 

u':v* — >u>* u:v* — >w* 

= Pr K idle ] Y Pr t u idle i u ' idle ]' ( 2 - 9 ) 

u':v*— >w* u:v*^w* 

where the sums are over all paths from v* to w*. For each path it', we can find by 
Proposition 2.1 an automorphism $ that carries u' to the the path u* in which all links 
are labelled 0*. Applying this automorphism to both u and u' gives Pr[u idle | u' idle] = 
Pr[$(u) idle | u* idle], since the probability distribution on states of the graph is invariant 
under automorphisms. Furthermore, 



Y P A U idle I u ' idle ] = Y Fr l$( u ) idle I u * idle i 

Pt[u idle | u* idle], 



u:v*^w* u:v*—*w* 



since both right-hand sides sum the same terms in different orders. Thus the inner sum in 
(2.9) is independent of u', and we have 

Ex[x 2 ] = Pi V idle ] Y Fr ^ u idle i u * idle ]' 
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so that Ex[X 2 ] factors as the product of two sums. The first sum is just Ex[X] . To evaluate 
the second sum, we observe that Pr[*u idle | u* idle] is just qi , where j is the number of 
links on u that are not labelled k . Thus the second sum is (pi(q), and the theorem follows 
from Proposition 2.5. □ 

3. The Linking Probability 

Our goal in this section is to prove Theorem 1.2. Thus in this section we shall always 
assume that b > 2 and < q < 1 are fixed, and that k — > oo and / = ck for some fixed 
rational c > 1. Thus the constants in O-tems may depend on c as well as on b and q, but 
not otherwise on or I. We shall also assume that k is even; the case of odd k requires 
only that k/2 be replaced by [k/2\ and \k/2~\ in appropriate ways. 

Lemma 3.1: Let G^ kl be the graph obtained from Gb,k,i by reversing the direction of its 
edges and exchanging the roles of its inputs and outputs. Then G^ kl is isomorphic to 

Gb,k,i- 

Proof: The isomorphism takes the vertex with label ai • • • in rank m of Gb,k,i to the 
vertex with label al ■ ■ ■ a* k in rank I — m of Gl k t , where a* = aj with j = l + l — i (mod k) 
(or vice versa: it is an involution). □ 

Lemma 3.1 establishes a symmetry between Gb,k,i and G* b k t , which we shall invoke 
by use of the term "dually". (When / is even, Gb,k,i is in fact isomorphic to a graph with 
manifest bilateral symmetry, as is shown in the Appendix of Pippenger [P3].) 

Lemma 3.2: Let (Gb,k,i)m,n , with < m < n < /, be the subgraph of Gb,k,i comprising 
the vertices in ranks m (now considered inputs) through n (now considered outputs) and 
the edges in stages m + 1 through n. Then {Gb,k,i)m,n is isomorphic to Gb t k,n-m- 
Proof: The isomorphism takes the vertex with label a\ • • ■ in rank h of Gb,k,n-m to 
the vertex with label a[ ■ ■ ■ a' k in rank m + h of (Gb,k,i)m,n, where a\ = aj with j 
i ■ in (mod k). □ 

Corollary 3.3: Between any given input and any given output of (Gb,k,l)m,n, there are 
ya-m-k p^^g if n — m > /c, and either one path or none if n — m < k. 

Proof: Immediate from Lemmas 3.2 and 2.2. □ 

We begin with the upper bound to Q. For < q < q c , where q c = 1/&( C_1 )/ C , we have 
Q — > by (1.2). For q c < q < 1, we shall use the following lemma from Pippenger [P3] 
(Corollary 4.2). 
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Lemma 3.4: Let T r be a complete balanced 6-ary tree of depth r, and let each vertex of T r 
except for the root be considered idle with probability q independently. Let the random 
variable Z r denote the number of leaves (vertices at depth r) for which every vertex on 
the path from the root (exclusive) to the leaf (inclusive) is idle. Then we have 

Pr[Z r = 0]=£ + O(rf) 

as r — > oo with b > 2 and 1/6 < q < 1 fixed, where £ is the unique solution of the equation 
(l - q(l - £)) 6 = £ in the range < £ < 1, and rj = 6(l - q(l - 0) b_1 < 1- 

Now set r = k/2 and s = I — k/2. The paths from an input v to links in rank r of 
Gb,k,i form a tree isomorphic to T r (if we ignore the directions of the edges), and the paths 
from links in rank s to an output w for a disjoint tree isomorphic to T r . Thus the number 
of links u in rank r for which all the links on the path from v to u are idle is a random 
variable U with the same distibution as Z r . Dually, the number of links u in rank s for 
which all the links on the path from utow are idle is an independent random variable U' 
with the same distribution as Z r . If v and w are linked, then we must have U > 1 and 
U' > 1, so by Lemma 3.1 we have 

Q < Pr[*7 > 1, £T > 1] = (1 - 2 + 0( V r ). 

This completes the upper bound for Theorem 1.2. 

We now turn to the lower bound for Theorem 1.2. Since this result has been proved for 
c < 2 in Pippenger [P3], we shall assume that c > 2. (This assumption could of course be 
avoided, but it would require a more complicated choice of parameters and consideration 
of cases.) For < q < q c , there is nothing to prove, since Q is certainly non-negative. For 
q c < q < 1, we shall use the following lemma from Pippenger [P3] (Lemma 8.1). 

Lemma 3.5: With Z r as in Lemma 3.4 and 1 < H < (bq) r ', we have 

Pr[Z r <H]<£ + 0{{H/(bqy) a ) 

as r — > oo with b > 2 and 1/6 < q < 1 fixed, where a = log(l/?y)/ \og(bq) and 77 is as in 
Lemma 3.4. 

Supposing that q c < q < 1, we shall define 
We observe that q < 1 implies (/* < (/ c -i, and that q c < q implies < q. 
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Lemma 3.6: Let k — > oo and / = c/c, with b > 2, q c < q < 1 and c > 2 fixed. Then for all 
sufficiently large k, we have 

iph(q*) < k 

for all < h < I - k. 

Proof: From Proposition 2.4 we have 

Since q* < q c -i and h < I — k, each term is 0(1), and thus at most k for all sufficiently 
large k. □ 

Let 

We observe that v and w will be linked if the following three events occur. 

I. The input v is joined by paths containing only idle links to all the links in a set V 
containing at least H idle links in rank r. 

II. All the links in a set W containing at least H idle links in rank s are joined y path 
containing only idle links to the output w. 

III. There is at least one path containing only idle links from some link in V to some link 
in W. 

By Lemma 3.5, we have 

Pr[/]>l-^ + 0((^/ ? ) r ), 

and since q* < q we have Pr[J] — > 1 — £. Dually, we have by Lemma 3.5 

Pr[II}>l-^ + 0((qJq) r ), 

and thus also Pr[iT] — > 1 — £. Since events I and II are independent, we have Pr[J, iT] — > 
(1 — £) 2 . Thus to complete the proof of the lower bound for Theorem 1.2, it will suffice to 
show that 

Pr[/// | /, II] -> 1. 

Event III depends on events I and II through the sets V and W. We can avoid having 
to consider this dependence by showing that Prf/iT] — > 1 for any sets V and W each 
containing at least H links. Thus it will suffice to prove the following propostion. 
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Proposition 3. 7: Let V and W be any sets of links in ranks r and s, respectively, each 
containing at least H links. Then 

Pt[III] -> 1 

as — > oo with / = c/c and 6 > 2, c > 2 and g c < g < 1 fixed. 

Proof: Since Pr [III] can only increase if links are added to V or W, we may assume that V 
and W each contain exactly H links. And since Pr [III] can only increase if q is increased, 
it will suffice to estimate Pr [III] assuming the vacancy probability to be q* < q rather 
than q. 

Let the random variable Y be the number of paths containing only idle links joining 
some link in V (exclusive) to some link in W (exclusive). Then event III is equivalent to 
Y > 1, so it will suffice to show that Pr[Y = 0] — > 0. To do this, we shall use Chebyshev's 
inequality: 

Each path from a link in rank r (exclusive) to a link in rank s exclusive contains s — r — 1 = 
I — k — 1 links. Since each of these links is independently idle with probability q*, the 
probability that such a path contains only idle links is q[~ k ~ 1 . By Corollary 3.3, the number 
of such paths joining a given link in rank r with a given link in rank s is b s ~ r ~ k = b l ~ 2k . 
Since there are H links in each of V and W, we have 

Ex[Y] = H 2 b l - 2k q l ~ k - 1 . 
Next we must estimate Var[Y]. We have 

Var[Y] = ( Pr t M ' u ' idle ] ~ Pr t M idle ] Pr [ M ' idle 

u':V^W u:V^W 

= Y Pr K idle ] Yl ( Pr t M idle I u ' idle ] ~~ Pr t w idle 



U ':V->W u:V^W 



Here each sum is over all H 2 paths joining a link in V to a link in W, so there are H 4 
terms in all. If u is a path from a link in rank r to a link in rank s, let g(u) denote the 
link in rank r and <r(u) the link in rank s. By Proposition 2.1, we may assume (as in the 
proof of Theorem 1.1) that u' = u* is part of a path from v* = k through g(u') = k and 
a(u') = 0* to w* = fc , in which all the links have label fc . Thus we have 

Var[Y] = H 2 b l ~ 2k q 1 '^ 1 ( Pr l u idle I u * idle ] ~ Pr l u idle D • 

u:V^W 
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The factor H 2 b l 2k q[ k 1 multiplying the sum is Ex[Y], so to show that Var[F]/Ex[F] 2 — > 
0, it will suffice to show that J/Ex[Y] — > 0, where 

J = ( Fr i u idle I u * idle ] - Fr i u idle D- 

u:V->W 

We now partition the paths u into four classes: 

i. those for which g(u) = cr(u) = fc ; 

ii. those for which g(u) ^ k but a{u) = fc ; 

iii. those for which g{u) = k but a{u) ^ fc ; and 

iv. those for which g{u) ^ fc and a{u) ^ k . 

We shall denote the contributions to J over these four classes by J i? J ii7 Jm and J iv , 
respectively, and estimate them in turn. 

For J i? we have 

Ji < Pr[u idle | u* idle] 

u .Qk^Qk 

< k 



by Lemma 3.6. Thus we have 



Ji < 



Ex[Y] " H 2 b l-2k q i-k-i 

k 



< 



b l k q[ 
0, 



since > q c . 

For Ji;, we have 



Jii < Pr \- U idle I u * idle ]- 

V\{0 k }^0 k 

To estimate Pr [it idle | u* idle], let i be the first rank for which a link in u has label k . 
Since there are two distinct paths in (Gb,k,i)o,i from v* through g(u*) = k and g(u) ^ k 
to this link, we must have i > k + 1 by Corollary 3.3. Thus we have 



ji < (h - 1)1 Yl ?r r -v s - 4 (^) + £ 6 i - r -* 9 *- r_ v.-i(«*) 

y/c+l<i<A;+r fc+r+l<i<s 
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where the factor of H — 1 accounts for the choice of g(u) e V \ {0 fc }, the factors preceding 
ip s -i(Q*) i n the sums account for the probability that all the links on it between ranks r 
(exclusive) and % (exclusive) are idle, the factors of V's-iO?*) account for the probability 
that all the links of it between ranks i and s that are not labelled fc are idle, and we have 
bounded ip s -i(q*) using Lemma 3.6. Bounding the sums by the number of terms (at most 
s — r = l — k) times the largest term (the first for the first sum, and the last for the second), 
we have 

Jii <{H — l)k (I - k){q k J 2 + b l - 2k q l - k - 1 ). 



Thus we have 



Jii < k(l-k)(q k J 2 + b l - 2k q l - k - 1 ) 



Ex[Y] " HU-^q 1 ^- 1 

< HI - k) ( —^z . , 4 



b l-3k/2 q l-k (bqJk/2 

-o, 

since b c ~ z l 2 q c ~ x > b c ~ 3 / 2 q^~ 1 = b~ l / 2 q~ l > b~ x l 2 q^} = 1 (because <?* > q c , b c ~ x q c c = 1, 
q c < q2 and bq 2 = 1) and bq* > 1 (because q* > qoo = !/&)• 

Dually, we have Jiii/Ex[Y] — > 0. 

Finally, for J iv we have 

j iv = ( Fr i u idle i u * idle ] - Pr t u idle ]) 

u:V\{0 k }^W\{0 k } 

i F A u idle I u * idle ] - Pr t w idle ]) 

u:V\{0 k }^W\{0 k } 

< Fr i u idle i u * idle ]' 

u:V\{0 k }^W\{0 k } 

since if it fl it* = 0, the events "it idle" and "w* idle" are independent and the summand 
Pi[u idle | it* idle] — Pr[it idle] vanishes. Given a path it with it fl u* ^ 0, let i be the first 
rank in which u has a link with label fc , and let j > i be the last such rank. As in case ii, 
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we have k + 1 < i, and dually we have j < I — k — 1. Thus we have 

Jw <{H-lf( E Q^-^i-iiQ*)^- 1 

^ k+l<i<k+r l-k-r<j<l-k-l 
i<j 

+ E E ^"Vi-ifeOfr^'"*^'- 1 

fc+l<i<fc+r r<j<l — k— r — 1 
i<j 

+ E E 6 i_r " fc «i" r "Vi-i(9*)«:" i " 1 

fc+r+l<i<s l-k-r<j<l-k-l 

i<j 

+ E E 6 i " r " fc «i" r "Vi-i(«*)6 a " i_ *«r i - 1 ). 

fc+r+l<i<s r<j<l — k— r — 1 

Here we have broken the sum into four parts, according to whether k + l<i<k + ror 
k + r + l<i<s, and independently according to whether I — k — r < j < I — k — lor 
r < j < I — k — r — 1. (We note that the second and third double sums will vanish unless 
c > 5/2, and the fourth double sum will vanish unless c > 3.) The factor of (H — l) 2 
accounts for the choice of g(u) e V \ {0 k } and <t(/u) G W \ {0 fc }, the factors preceding 
ipj-i(q*) in the summands account for the probability that the links of u in ranks less 
than i are idle, the factors or ifjj-i(q*) account for the probability that the links of u 
between i and j and not labelled k are idle, and the factors following tpj-i(q*) in the 
summands account for the probability that the links of u in ranks greater than j are idle. 
Bounding the factors ipj-i(q*) using Lemma 3.6, and bounding each double summation by 
the number of terms (at most (/ — k) 2 ) times the largest term (which occurs for i = k + 1 
and j = I — k — r in the first sum, and for i = j for the remaining three sums), we obtain 

J iv < (H - lfk(l - k) 2 (q k + 2b l - bk / 2 - 1 qi~ k ~ 2 + b l ~ U 'ql^' 2 ) • 
Thus we have 



Ex[y] " v ' \(bq*) l - 2k ^6 fc / 2 +! q*b k 

since bq* > 1, c > 2 and b > 2. This completes the proof of the proposition, and with it 
the proof of Theorem 1.2. □ 
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4. Conclusion 

We have determined the limiting value of the linking probability in spider-web net- 
works with scale k and depth / when / = ck with c > 1. The same method could be used 
when l/k — > oo but (log(/ + l))/(k + 1) — > 0. In this case, the phase transition would be 
less abrupt: the limiting value of Q, and even its first derivative with respect to q, would 
be continuous at the critical value q^ = 1/6, but the second derivative would be discon- 
tinuous. There would be little to be gained by such networks, however, over those with 
a large fixed value of c: Their great cost would decrease the critical vacancy probability 
through only a small interval [q^ , q c ] , and would provide only a small linking probability 
in this interval. 

Another extension of our results would be to consider, instead of the "independent" 
probability distribution on states introduce by Lee [LI] and Le Gall [L2, L3], the "coher- 
ent" distribution introduced by Pippenger [PI]. (The similar distribution introduced by 
Koverninskh [K] does not have an obvious generalization for c > 2, and in any case it does 
not seem likely that the additional independence in Koverninskh's model would have much 
effect on its tractability for c > 2.) 

Yet another line of inquiry would be to consider the computational complexity of path- 
search problems for spider-web networks with c > 2, using the link-probe model introduced 
by Lin and Pippenger [L4]. Some such results were obtained by Pippenger [P4] for c = 2 
(and these results are easily extended to the case 1 < c < 2), but even for c = 2 the known 
results are far from definitive. 
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